Mdl Multiple Hypothesis Testing
نویسندگان
چکیده
This paper examines the problem of simultaneously testing many independent multiple hypotheses within the minimum encoding framework. We introduce an efficient coding scheme for nominating the accepted hypotheses in addition to compressing the data given these hypotheses. This formulation reveals an interesting connection between multiple hypothesis testing and mixture modelling with the class labels corresponding to the accepted hypotheses in each test. An advantage of the resulting method is that it provides a posterior distribution over the space of tested hypotheses which may be easily integrated into decision theoretic post-testing analysis.
منابع مشابه
An MDL Approach for Multiple Low Observable Track Initiation∗
In this paper the track initiation problem is formulated as multiple composite hypothesis testing using maximum likelihood estimation with probabilistic data association (ML-PDA), an algorithm known to work under very low SNR conditions. This algorithm does not have to make a decision as to which measurement is target originated. The hypothesis testing is based on the minimum description length...
متن کاملP 4: The Hypothesis Detect Multiple Sclerosis in Early Stage with Saliva Testing
Introduction: Recent studies point to the clinical and research efficacy of saliva as a respected diagnostic aid for observing Multiple Sclerosis. The objectives of this Hypothesis are to identify novel biomarkers recognized to Multiple Sclerosis in early stage in saliva and to determine if the levels of these markers correlate with level of these Cerebrospinal fluid and blood assays and urine ...
متن کاملHypothesis Selection and Testing by the MDL Principle
The central idea of the MDL (Minimum Description Length) principle is to represent a class of models (hypotheses) by a universal model capable of imitating the behavior of any model in the class. The principle calls for a model class whose representative assigns the largest probability or density to the observed data. Two examples of universal models for parametric classesM are the normalized m...
متن کاملMinimum description length methods of medium-scale simultaneous inference
Nonparametric statistical methods developed for analyzing data for high numbers of genes, SNPs, or other biological features tend to overfit data with smaller numbers of features such as proteins, metabolites, or, when expression is measured with conventional instruments, genes. For this medium-scale inference problem, the minimum description length (MDL) framework quantifies the amount of info...
متن کاملLogic Program Induction using MDL and MAP: An Application to Grammars
Probabilistic programs provide an appealing language for describing mental theories, because they are Turing complete: any computable process may be described as a program. Program induction is the problem of inferring theories, in the form of (probabilistic) programs, that describe some set of observations. Minimum Description Length, or MDL, is one common approach to program induction [11]. T...
متن کامل